Artificial Intelligence (AI) refers to the capability of a machine to imitate intelligent human behavior. When talking about Vision recognition, AI allows computers to interpret and understand the visual world. Using digital images from cameras and videos, AI systems can identify and classify objects, recognize patterns, and make decisions. This technology is used in various applications, from facial recognition in security systems to analyzing medical images for diagnostics, which will be the use for this project.
AI systems learn to recognize images through a process similar to how humans learn from experience. The core of this process involves training an AI model using large sets of images. This training involves feeding the AI system examples of images that are already labeled, like pictures of cats labeled as "cat" so the system learns to associate the visual patterns with the label. Over time, the AI learns to recognize these patterns and can identify and classify new images it has never seen before.
There are several types of AI models that can be used for vision recognition, including:
A vision recognition project typically involves several key phases:
Teachable Machine from Google is an easy-to-use web tool that allows users to create machine learning models without any coding experience. It's designed to make AI accessible by enabling anyone to train and deploy models directly from their web browser.
For my week I decided to take a biomedical approach and download a dataset of Brain MRI Images for Brain Tumor Detection and have the machine learn to classify a Tumorous brain MRI and a non-tumorous brain MRI. However there is a BIG DISCLAIMER, this is a academic driven project and is not intended for real medical, diagnose or treatment use.
After this week, I learned a lot about how AI operates, particularly in the image recognition. This knowledge has significantly broadened my perspective and gave me the skills needed for a future improvement for my final project. For this project, I plan to utilize a camera to provide a live feed of the patient's footprint. By integrating AI technology, I can potentially assist physicians in diagnosing various foot health and posture-related pathologies, such as flatfoot or hollow foot. The AI's capabilities in analyzing and recognizing patterns in the live feed could provide valuable insights that would otherwise be challenging to detect.
However, it is essential to emphasize that AI is a powerful tool that complements, rather than replaces, the expertise of physicians. While the AI can process and analyze the data, the ultimate diagnosis and treatment decisions rest with the healthcare professional. The physician will consider the AI's findings along with the patient's comprehensive clinical information to determine the best possible treatment for each pathology.